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Recent top quark event modelling studies done using CMS proton-proton data collected at a centre 
of mass energies of 8 and 13 TeV and state-of-the-art theoretical predictions accurate to next-to- 
leading order QCD interfaced with PYTHIA8 and HERWIG++ event generators are summarised. 
The particle-level top quark (pseudo-top), underlying event measurement in ff events and parton 


shower tuning using ¢f events are discussed. 
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1. Introduction 


Top quark measurements provide important tests of QCD. Better understanding of perturbative 
and non-perturbative effects is required to obtain the highest possible precision in top quark mass, 
its interpretation and in other properties. Top quark measurements are also important to improve 
the accuracy of predictions in different phase space regions in searches for beyond the Standard 
Model (SM) effects. Precise top quark measurements are also be used in direct searches for new 
physics as well, e.g. with effective field theories expanding the SM Lagrangian. For these purposes, 
the uncertainties in the measurements and the predictions need to be at a level where deviations of 
the predictions of the Monte Carlo (MC) codes or deviations due to new physics effects be visible. 
State-of-the-art next-to-leading order (NLO) matrix element (ME) event generators interfaced to 
new parton shower (PS) codes used in LHC Run II may provide better modelling and eventually 
reduce the major theoretical uncertainties. The theory uncertainties can partially be tested and 
improved with datasets that allow differential measurements with well-defined top-quark objects. 
In this note, a selection of recent top quark event modelling studies from CMS[1] are discussed. 


2. Particle Level Top Quark 


Simulations at NLO take the finite width of the top quark into account which is important to 
accurately model the off-shell production of top quarks and their interference with the backgrounds. 
In these calculations the concept of top quark as a particle is not well-defined and MC dependent. 
One can only use the kinematics of the final-state particles unambiguously. A particle-level top 
quark (also called pseudo-top quark) can be constructed from the final-state objects after hadro- 
nisation. Using particle-level top quarks would yield smaller uncertainties from non-perturbative 
effects and from acceptance corrections because of the similar phase definitions at the particle and 
detector levels minimising MC dependence. The details of particle-level top quark definitions and 
their adoption in the RIVET [2] framework in the official CMS reconstruction code are discussed 
in [3] as a fundamental aspect for current and future measurements of differential production cross 
sections in both ¢f and single-top quark production. The results reported in [3] indicate that the 
particle-level top quark definition needs to be optimised depending on the production mode, the 
final state or the variable and the phase space being investigated. 


3. Underlying event and Parton shower tuning in tt events 


The CMS Run I combination of direct top quark mass measurements at 7 and 8 TeV, in lep- 
ton+tjets, dilepton, and all-hadronic channels yields a precision of 0.3% [4]. In this result, the 
dominant uncertainties are related to the event modelling. Therefore, to improve top quark mass 
measurements, dedicated measurements and theory studies are required. The the b quark from the 
top quark decay carries the colour flow. To become colourless, the b quark "connects" with the 
beam remnants or other coloured final particles produced in the event. A b jet in the final state 
can be constructed however the uncertainty in the origin of all the final states in the jet results in 
"odd clusters" (e.g. see [5]). Therefore, it is important to have an accurate description of b quark 
fragmentation and hadronisation, as well as UE is needed. 
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UE measurements use the highest pr charged-particle jet, the highest Er calorimeter jet, ad 
the Z-boson direction as the leading-object to define regions of 7 — @ space, in the toward, away, 
and transverse regions. The transverse region is particularly sensitive to the modelling of the UE. 
The PYTHIA8 tune CUETP8M1 and the HERWIG++ tune EESC [6] are reconstructed by fitting the 
UE data at several centre-of-mass energies, where the leading object is the highest pr charged 
particle or the charged-particle jet in the event. These tunes describe well the UE as measured in 
Z-boson production. However, very little is known about UE in heavy quark production. Ref. [7] 
compared detector-level top-quark production data at 13 TeV with the PYTHIA8 CUETP8M1 tune 
and the HERWIG++ EESC tune after detector simulation in tf enriched events in the lepton+jets 
channel. Both of the parton shower models are interfaced with POWHEG V2. Fair agreement is 
observed between POWHEG V2+PYTHIA8 CUETP8M1 tune predictions. It is also observed that UE 
is sensitive to QCD scales. Fig. 1a displays the charged-particle multiplicity when the PS scale is 
increased from its default value. A complete measurement of UE in tt events at particle-level may 
lead to more precise top quark mass with better understood systematic uncertainties. 

It is observed that the predictions of the NLO MC ME generators + PYTHIA8 CUETP8M1 tune 
[8] overshoot the v/s = 8 [9, 10] and 13 TeV [10, 11] data for large jet multiplicities when out of 
the box parameters are used, while all other distributions are modelled well (except top quark pr). 
The tune CUETP8M1 is based on the Monash tune [12]. Accurate predictions of this observable is 
particularly important in measurements of the Higgs boson and many new physics search analysis. 
To improve the description of high jet multiplicities in tt events, a number of parameters have been 
studied and the most sensitive ones to jet kinematics in tf events are selected and optimised. The 
strong coupling parameter at mz for initial-state radiation in the PS, gi , and the hgamp parameter 
that controls the jet matching in the POWHEG V2+PYTHIA8 [13, 14, 15] setup are tuned using 
Run 1 data on jet activity in t7 events. The Monash tune for œ? adopts the a/*% 


A ; > value tuned 


to LEP event shapes. This is found to be the main cause of overproduction of jets in the MC. 
We tuned the values of a/%* and hgamp Using the jet multiplicity and leading additional jet pr 
distributions in the dilepton final state measured at \/s = 8 TeV using the PROFESSOR tool [16]. 
In this procedure, all other PYTHIA8 parameters are kept fixed to the ones in CUETP8MI tune. It is 
observed that œ}? 
the leading additional jet pr. This is in agreement with the fact that the leading additional jet, in 
the POWHEG V2+PYTHIA8 configuration, stems from the real radiation calculated by the POWHEG 
v2 generator. The tuning procedure yields damp =1.581+9:838 x m, and ae =0.1108 ere: 


The tuned a/5* value agrees with the PDG value of a,(Mz) = 0.1181 +0.0011 [17] well within 
ISR 


impacts mostly Njers > 3, while hgamp affects the ratio of 2-to-3-jet events and 


uncertainties. Fixing Agamp to its default value of m,, a re-tuning of a;°" alone to the same data 


S 
yields a/5* =0.1 15 [18] again in agreement with the PDG value. POWHEG V2+PYTHIA8 with 
these optimised parameters cures the overshoot of the CUETP8M1 at high jet multiplicities. 

The jet activity mainly constrains those parameters that control the probability for parton emis- 
sion and the interplay between hard and soft parton emission. The jet activity, however, does 
not strongly constrain the global production of hadrons known as the underlying event (UE). 
Therefore, a/°* as determined from ff jet kinematics can be used as a fixed parameter to tune 
the UE. See ref. [10] for the details of the CUETP8M2T4 tune derived fixing a/5® to 0.1108. 
The performance of the CUETP8M2T4 PYTHIA8 tune is evaluated in different configurations. It 


is found that both POWHEG V2+PYTHIA8 and MGS_aMC@NLO + PYTHIA8 with FxFx merg- 


ing [19] describe the top quark data well (except for top quark pr independent of the tune), while 
MG5_aMC @NLO +PYTHIA8 with MLM matching [20] and the inclusive aMC @ NLO +PYTHIA8 does 
not describe the data in general (e.g. see Fig. 1b). It is also observed that the global event vari- 
ables such as Hr or Sr do not get modified significantly with the change of a/5® (except for 
MG5_aMC@NLO + PYTHIA8 [MLM] and aMC@NLO +PYTHIA8 independent of the tune for 
some variables such as the jet multiplicity) [10]. The comparisons of predictions of POWHEG 
V2+PYTHIA8 with the CUETP8M2T4 tune for six different differential cross-sections to the ones 
measured with 35.9 fb~! data at \/s = 13 TeV and yield an overall p-value from < 0.01 when 
theory uncertainties are ignored to 0.91 when theory uncertainties are included [21]. 
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Figure 1: The charged particle multiplicity distributions for the away, transverse and toward re- 
gions as well as for the overall sample (a). Distributions are obtained with the increased scale, i.e. 
(2Q)? that matches the data better. The points correspond to the data at the detector level and the 
lines represent the POWHEG V2+ PYTHIA8 predictions with the CUETP8M1 tune. Each distribu- 
tion is normalised to one. CMS data at 13 TeV on the normalised cross section in the lepton+jets 
channel, as a function of the number of additional jets (b). The data are compared with the predic- 
tions of POWHEG V2, MG5_aMC@NLO , either with MLM matching or FxFx merging, and with 
aMC @NLO interfaced with PYTHIA8 with CUETP8M1 and CUETP8M2T4 tunes. Also shown is the 
ratio of the theory and the data (theory/data), where the yellow band indicates the total experimental 
uncertainty of the data. 
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